klotz: fine tuning*

Bookmarks on this page are managed by an admin user.

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. Introduces proxy-tuning, a lightweight decoding-time algorithm that operates on top of black-box LMs to achieve the same end as direct tuning. The method tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the larger untuned model in the direction of tuning, while retaining the benefits of larger-scale pretraining.
    2024-05-11 Tags: , , , by klotz
  2. - GitHub repository for a tutorial series called "0 to LitGPT."
    - Provides an overview of how to get started with LitGPT, which is an open-source implementation of GPT-3.
    - Offers various resources such as codes, issues, pull requests, actions, security features, insights, and more related to the LitGPT project.
    2024-03-28 Tags: , , , , by klotz
  3. - Discusses the use of consumer graphics cards for fine-tuning large language models (LLMs)
    - Compares consumer graphics cards, such as NVIDIA GeForce RTX Series GPUs, to data center and cloud computing GPUs
    - Highlights the differences in GPU memory and price between consumer and data center GPUs
    - Shares the author's experience using a GeForce 3090 RTX card with 24GB of GPU memory for fine-tuning LLMs
    2024-02-02 Tags: , , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: fine tuning

About - Propulsed by SemanticScuttle